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Abstract 

o 

CnJ ' We demonstrate the use of a new, control-oriented notion of finite state approximation for a 

C^i , particular class of hybrid systems. Specifically, we consider the problem of designing a stabilizing 

binary output feedback switching controller for a pair of unstable homogeneous second order systems. 
The constructive approach presented in this note, in addition to yielding an explicit construction of 
' a deterministic finite state approximate model of the hybrid plant, allows us to efficiently establish a 

useable upper bound on the quality of approximation, and leads to a discrete optimization problem 
whose solution immediately provides a certifiably correct-by-design controller for the original system. 
The resulting controller consists of a finite state observer for the plant and a corresponding full state 
feedback switching control law. 
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I. Introduction 

A. Motivation and Overview 



CSJ ■ Finite state approximations and abstractions of hybrid plants have been explored as a means 

for addressing instances in which the interaction between analog and discrete dynamics^ leads 
to complex analysis and synthesis problems that cannot be adequately handled by traditional 
methods. Some of the early work in this area explored the use of 'qualitative' models, namely 
non-deterministic automata whose output behavior contains that of the original hybrid system 
[19]. In [21], an approach for deriving non-deterministic finite state approximations of systems 
with quantized outputs was proposed and used in conjunction with the supervisory control 
theory developed by Ramadge and Wonham [22] to design controllers meeting the desired 
specifications. Another line of research, inspired by formal verification techniques in computer 
science, explored the construction of deterministic discrete abstractions of hybrid systems [2]. 



Early work in this area focused on identifying classes of systems that admit finite bisimulation 



abstractions [11], [15]. Having recognized that these classes are fairly limited, more recent 
work has focused on finding discrete abstractions that are related to the original hybrid system 
by an approximate simulation or bisimulation relation [10], [25], [26]. The resulting controller 
design problem is then a two step procedure in which a finite state supervisory controller is 
first designed, and subsequently refined to yield a certified hybrid controller for the original 
plant [27]. 

In this note, we demonstrate the use of a ne alternative notion of approximation that, 
in addition to yielding a finite state deterministic approximate model of the original hybrid 
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system, allows one to efficiently establish a useable upper bound on the quality of approx- 
imation, and leads to a discrete optimization problem whose solution immediately provides 
a certifiably correct-by-design (finite state) controller for the original system. The notion of 
finite input/output approximation employed in this note is thus potentially better suited to the 
purpose of control design than the existing methods since it allows for a streamlined and 
efficient synthesis procedure. Another promising development is that, in contrast to many of 
the existing methods and our early work on this problem, we have succeeded in approximating 
unstable systems by finite state automata for the purpose of control design. 

We demonstrate this approach in a particular setting, namely for the problem of designing a 
stabilizing, binary output feedback, switching controller for a pair of unstable, homogeneous, 
second order systems. This problem was specifically chosen because several of its simpler 
formulations have been extensively studied and are well understood [7], [17]. Necessary and 
sufficient conditions for stability of switched second order homogeneous systems were derived 
in [9]. A Lyapunov based approach for designing stabilizing full state feedback switching 
controllers for second order homogenous systems was proposed in [12]. While the results 
available for general homogeneous systems remain limited, the special case of switched linear 
systems has been more extensively studied. For instance, in the linear full state feedback analog 
case, the existence of a Hurwitz convex combination (A eq = a,A\ + (1 — a)A 2 , a £ (0, 1)) 
of a given pair of unstable state matrices (Ai and A 2 ) is known to be a necessary [8] and 
sufficient [34] condition for the existence of a quadratically stabilizing switching controller. 
The switching controller can take on a variety of forms in this case (time-switched controller, 
hybrid controller, etc.). In particular, when A eq has a real eigenvalue, a quadratic switching 
surface and corresponding state dependent variable structure control law [13] can be designed 
to stabilize the system [34]. In this case, once designed assuming full state feedback, the 
stabilizing controller can be exactly implemented by transmitting appropriately chosen binary 
state information from the plant to the controller, namely information that the state is currently 
in one or the other region of the state-space. Lyapunov based approaches have also been 
extended to time-sampled versions of the problem [6], [18], and to the output feedback case 
[24]: Typically the output is assumed to be a linear function of the state, and the resulting 
controller is observer-based [8]. In contrast, the setting considered in this note where fixed, 
binary sensors are used leads to a more difficult state estimation (and thus controller synthesis) 
proble This case has been much less well studied, and has only begun to receive attention 
in the recent past [4], [20], [23]. Specifically, the use of finite state approximations in this 
context remains minimally explored. 

The constructive controller synthesis approach can be summarized as follows: First, the plant 
is approximated by a deterministic finite state machine and a usable bound on the quality of 
approximation is efficiently established. Next, a switching control law is designed to robustly 
stabilize the nominal finite state machine model in the presence of admissible approximation 
uncertainty. The resulting controller, which is thus certifiably correct-by-design, is then also a 
finite state system. Additionally, the procedure by which it is synthesized is computationally 
tractable in the sense that it is based on & finite state nominal model. 

Practically, this design approach may be utilized in applications where very coarse sensors 
are used to keep operating cost, weight or power consumption low. Looking ahead, having a 
systematic procedure for designing control systems specifically for the case where the plant 

3 This setup differs significantly from the variable structure controller setup described above, as the binary sensors are 
assumed to be fixed a priori with no regard to the system dynamics. As such, there generally is a mismatch between the 
information provided by the sensors and the binary information needed to implement a variable structure switching law. The 
resulting state estimation problem is at the core of the design challenge here. 
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and controller interact through finite alphabets points to new paradigms for control over 
networks in which the amount of information to be encoded and transmitted over communi- 
cation channels is significantly reduced. Finally, this design procedure may be employed by 
artificially imposing coarse measurements on a switched system for which there is no other 
available or tractable approach for synthesizing controllers, though the relevant question of 
how to best impose a finite measurement quantization for the control objective of interest is 
not addressed in this paper. 

B. Organization of the Paper 

The paper is organized as follows: The statement of the control design problem and an 
overview of the controller design procedure are described in Section HO The algorithms 
proposed for constructing a finite state approximation of the plant, and for computing an 
a-posteriori bound on the resulting approximation error are presented in Section Unl Design 
of the robust stabilizing switching law and the structure of the resulting switching controller 
are addressed in Section [W] Illustrative examples are presented in Section |VJ The paper 
concludes in Section |VT1 where recommendations for future work are given. 

C. Notation 

The notation used in the paper is fairly standard: Z+, R and R + denote the sets of non- 
negative integers, reals and non-negative reals, respectively. For real interval I = [a, b), \I\ — 



\b—a\ denotes the length of /. Given a vector v G M n , v' denotes its transpose and ||i>|| = \mJ 
denotes its Euclidean norm. Given sets A and B, card(A) denotes the cardinality of A, Ax B 
denotes their Cartesian product, A z+ denotes the set of all infinite sequences taking their 
values in set A and (boldface) a denotes an element of A z+ . Given a function / : A — > B 
and a proper subset A' C A, f\&> denotes the restriction of / to A', that is the function 
/| A' : A' — > B defined by /|^/(a) = f(a) for a G A'. Given two functions / : A — > B and 
g : B — )■ C, their composition, denoted by g o f, is a function g o f : A — >■ C defined by 
g o /(a) = g(f(a)). Given two real-valued functions / : A — > R and g : A — > R, the notation 
/ < g is understood to mean that f(a) < g(a) for all a G A. 

II. Problem Statement & Overview of the Design Procedure 
A. Problem Statement 

Consider a discrete-time plant P described by 



where the time index t G Z + , state x(t) G M 2 , performance output v(t) G M, and control 
input u is binary with u(t) G U = {0,1}. Sensor output y is also taken to be binary, 
y(t) G y = { — 1,1}, with the understanding that when c'x = 0, y is taken to be +1 in one 
quadrant and —1 in the other. Functions f u : M 2 — >■ M 2 and vector c G M 2 , c ^ 0, are given. 
The following conditions are assumed to hold for each mG {0,1}: 

1) f u is continuous. 

2) /„ is homogeneous with degree 1: That is, f u (ax) = af u (x) for all a G R, x G R 2 . 





(1) 



(2) 




(3) 
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In this setup, the effect of the control action is to pick a choice of system from a given pair 
and to hold that choice until the next time step, at which point a new measurement is available 
and a new choice of system is made and implemented. 

The objective is to design a controller K C y z+ x U z+ such that the closed loop system 
(P, K) with output v (Figured)) satisfies the following performance objective for some R > 0, 
for any initial condition of plant P: 



± 

sup (v(t) + r) 

1 - t=0 



< oo 



(4) 



Satisfaction of this performance objective guarantees that the state of the closed loop system 
(globally) exponentially converges to the origin, on average, at a rate not less than R. 



u(t)e {0,1} 




y(*)e{-l,l} 




Fig. 1. Closed loop system. 



B. Overview of the Controller Design Procedure 

Design of the stabilizing controller is an iterative procedure, with each iteration consisting 
of a sequence of steps. First, the plant P is approximated by the interconnection of a finite state 
machine M and a complex system A representing the approximation error (Figure [2]). Next, 
a meaningful and useable "gain bound" is established for the approximation error, system A. 
Finally, an attempt is made to synthesize a feedback switching law for the nominal finite state 
model M, that is robust to the approximation error A. If synthesis is successful, the resulting 
controller is guaranteed to globally exponentially stabilize the plant at some verified rate R. 
Otherwise if synthesis is unsuccessful, or if the verified rate is unsatisfactory, a more refined 
approximation (meaning a finite state machine with a larger number of states) is sought for 
the plant and the above process is repeated. 



u(t) E U 



P 



v(t) E 



w(t) E W 



y(t) e y 



A 



u(t) E U 



M 



u(t) E U 



v(t) E V 



Fig. 2. Plant P, its finite state machine approximation M, and the corresponding error system A. 
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III. A Finite State Approximation of the Plant 

A deterministic finite state machine (DFM for short) is understood to be a discrete-time 
dynamical system (independent variable t E Z + ) described by a state transition equation © 
and an output equation © 

q(t + l) = f(q(t),u(t)) (5) 
y(t) = g(q(t,u(t))) (6) 

State q(t) E Q, input u(t) E U, and output y(t) E y, where Q, U and y are finite state, input 
alphabet and output alphabet sets, respectively. 



A. A Notion of Approximation 

Given a plant P with binary control input u{t) E U = {0, 1}, binary sensor output y(t) E 
y = {—1,1} and upper bounded^ performance output v(t) E R. Consider a system M with 
inputs u(t) E U and w(t) E W = {0, 1}, and with outputs u(t) E U, y(t) E y and v(t) E V, 
where V is a finite discrete subset of R. Suppose that M has the internal structure shown 
in Figure |3l where M is a deterministic finite state machine. To ensure well-posedness, we 
require that there be no direct feedthrough from input y to output y in M: In other words, y(t) 
is only allowed to be a function of the input u{t) and the state of M at time t. Memoryless 
system <p is defined by 

y if w = 
— y if w = 1 



y 



System M is thus a deterministic finite state machine. 

Consider also the corresponding system A shown in Figure HI Memoryless system \1> is 
defined by 

"0 if y = y 
1 if y ^ y 



w 



System A is not a deterministic finite state machine in general, unless plant P is one. 
Moreover, the existing requirement that there be no direct feedthrough from y to y in M 
ensures that the outputs y of P and y of M cannot be trivially matched. 




Fig. 3. Internal structure of M, the deterministic finite state machine approximation of P. 



Now consider the interconnection, as in Figure [2l of M and A with these particular 
structures, and suppose that the two copies of M are identically initialized. Regardless of 
the choice of M, it can be seen by direct inspection that for arbitrary initial conditions of P 



4 It will become clear in Section UlI-BI that v is indeed upper bounded for the class of systems under consideration. 
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and for any input u G W z+ , the corresponding outputs y G y z+ of P and y G 3^ z+ of (M, A) 
are identical. 

Suppose that we can construct M such that the following two conditions are satisfied for 
any input u G U z+ and any initial condition of P: 

1) Outputs v G V z+ of the interconnection (M, A) and v G M z+ of P satisfy 

t)(*) > v(t), for all t G Z+. (7) 

2) For p : W — 7- {1} and : W — > IR+ defined by = w, there exists a constant 
7 G (0, 1) such that every feasible input/output pair of signals (u, w) G U z+ x W z+ of 
the error system A satisfies the gain condition: 

T 

inf^>(«(*)) -M^(t)) > -oo. (8) 



t=o 



The resulting deterministic finite state machine M is then said to be a finite state approximation 
of P, and the corresponding system A is said to be the approximation error. The following 
remarks aim to clarify the reasoning behind this approach to plant approximation. 



u 



L 



P 



M 



A 



w 



Fig. 4. The error system A of M and P. 



Remark 1: The structure of the approximation error A proposed in this setting is signif- 
icantly different from that of the approximation error in the traditional stable LTI model 
reduction setting. In that setting, a stable lower order LTI model is considered to be a good 
approximation of the original stable LTI system if the outputs of the two systems are not too 
different, in a worst case sense, when driven side by side by the same input. The structure 
proposed here, where the output of the plant P is fed back to M, is needed because the 
original system is not stable: Two copies of P initialized differently and driven side by side 
may end up with different outputs at every step. Thus, there is a need to explicitly 'estimate' 
the initial condition of P, otherwise there is no hope for satisfying gain condition © for any 
choice of M. 

Remark 2: The first condition characterizing a valid finite state approximation, namely the 
inequality in ©, is motivated by the control objective © at hand. In particular, if © holds 
and a controller ip C 3^ z+ X W z+ is designed to ensure that the closed loop system (M, A, ip) 
satisfies the auxiliary performance objective 

T 

sup y~)v(t) + R < oo, (9) 
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then a corresponding controller implemented in feedback with plant P is guaranteed to 
satisfy performance objective ©: We will return to address this statement in more detail in 
Section [IV] Upper boundedness of signal v ensures that © can always be satisfied for some 
appropriate finite choice of V. Intuitively, a better approximation in which the instantaneous 
difference between v and v is smaller is desirable and leads to a less conservative controller 
design. 

Remark 3: The second condition characterizing a valid finite state approximation, namely 
the 'gain condition' describing the approximation error A, is compatible with the framework 
and the tools for robustness analysis of systems over finite alphabets developed in [31]. The 
choice of functions p and \i proposed here are specific to the control objective at hand. 
In particular, the smallest value of 7 for which ([8]) holds (the 'gain' of A) represents the 
fraction of time (computed over an infinite window) that the outputs of P and M disagree 
in the worst-case scenario. A smaller value of 7 is thus desirable and indicative of a better 
approximation. 

In the following sections, a constructive procedure for generating a viable M and a com- 
putationally efficient algorithm for computing an a-posteriori upper bound on the gain of the 
resulting error system A are proposed. 



B. Construction of the Nominal Model 

The approach proposed for constructing a nominal finite state model to approximate plant 
P takes advantage of the dynamical properties specific to homogeneous systems, evident after 
a coordinate transformation; interested readers are referred to [28] for an overview of other 
potential approaches. Let 



x 



Xi 

x 2 



C-2 



/«(■) 



/«i(0 



In a polar coordinate system where r = \fx\ + x\ and 9 = tan ( — J , the dynamics of 
system P described in ©, (O and © are given by: 



r(t + l)=r(t) f uit) (/3(9(t))) 

e(t + r 

y(t) = sign{c'P(9(t)) 



v(t) = log( f u (t)(mt))) 



(10) 

(11) 
(12) 



where 



0(9) 



cos 9 
sin 9 



This coordinate transformation highlights two important properties of the class of systems 
under consideration: 

1) The evolutions of the angular coordinate 9 and of both outputs of P are independent 
of the radial coordinate r: The state of the system relevant to the stabilization problem 
at hand effectively evolves on the unit circle. 
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2) For brevity of notation in the following argument, denote the composition j3 o 9 simply 
by f3. For each t G Z+, we have (5{t) G [-1, 1] x [-1, 1] = R u f u (t)W(t)) e fu(t)(Ri) = 
where i?° and -R-j, ^ again compact in M 2 since each /„ is continuous. Thus we 

have 

\\Mt)W))\\< IMI = M 

x£R%,ue{0,l} 

and 

v(t) = log(\\f u{t) (/3(t))\\) < logM, for all t G Z+. 

It thus follows that t> is upper bounded. 
Compactness of the effective state set and upper boundedness of the performance output v are 
instrumental in ensuring that this class of plants is amenable to a finite state approximation. 

M is constructed by partitioning the unit circle into a collection of intervals and defining the 
potential states of M to correspond to the intervals and the unions of adjacent intervals. This 
partition generally need not consist of equal length intervals; however, it should be chosen 
such that the outputs associated with all the (analog) states of P whose angular coordinates 
lie in any given interval are identical, in order to avoid introducing unnecessary uncertainty. 

In particular, consider a partition of the unit circle consisting of intervals h, . . . , I 2n where 
Ii = [ai, a i+ \) for some sequence of angles «!<...< a 2n +i satisfying: 

cui = tan~ x ^ — — j , cti G [0, tc) 
a n+ \ = a i + ii 
«2n+l = cti + 2ir 

The number and choice of angles is a design parameter here. 
Construct a set Q of intervals on the unit circle: 

Q = Si u s 2 u . . . u s 2n 

where Sk is the set of all "k adjacent intervals". That is: 

51 = {h, h, ■ ■ ■ , hn} 

5 2 = {/ 1 U/ 2 ,/ 2 U/ 3 ,...,/2nU/i} 

5 3 = {/ 1 U/ 2 U/ 3l / 2 U/ 3 U/ 4) ..,/ 2n U/ 1 U/ 2 } 



>2n 



{h U h U . . . U I 2n } 



This set Q, consisting of 2n(2n — 1) + 1 distinct elements, is the set of all potential states of 
M. It will become clear shortly that continuity of f u rules out the union of two non-adjacent 
quantization intervals from being a potential state of M in this setup. 
For q G Q, let 



and 



q n [ai, a n+ i) if sign(c'/3(a 2 )) 

qn[a n+1 ,a 2n+1 ) otherwise 

q n [ai, a n+1 ) if sign(c'f3(a 2 )) 

q n [a n+1 , a 2n+ i) otherwise 
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Note that Vi(q) G Q and V-\(q) G Q, by construction. For u G U, q G <2, let 

-l//«2(/?(?))\ 



and 



i; = {«6{l,..., 2n}|g« n [a*, a i+1 ) ^ 0}. 



Remark 4: It follows from the continuity of f u that for any choice ofueU and g G Q, 
q!t is a single connected interval. Thus 



y [ai,ai + i) 



is an interval of the form [aj, in other words, an element of Q. 
The dynamics of M are then given by 

g(t + l) = f(q(t),u(t),y(t)) 

m = g(q(t)) (13) 

v(t) = h(q(t),u(t)) 
with f : Q xU x y ^ Q, g : Q ^ y and h : Q x U ^ V defined by 
f(q,u,y) = (J [a h a i+1 ) 



1 ifq = V 1 {q)ov\V 1 {q)\>\V- 1 {q)\ 

-1 if q = V- X (q) or \V x {q)\ < \V-x{q)\ 



a{q) = 

h(q,u) = sup log (\\f u (m)\ 

In particular, let q Q denote the potential state corresponding to the whole unit circle, that is 
q Q = [aii, «2n+i)- It is shown in the following Lemma that condition © is satisfied provided 
M is initialized to this state. 

Lemma 1: Consider a plant P and let M be the corresponding finite state machine defined 
by (TT3l) for some choice of integer n > and (admissible) angles a\, . . . , a 2n +i - If <?(0) = q , 
then for any input u G W z + and for any initial condition of P, the outputs v G V z+ of (M, A) 
and v G R z + of P satisfy ©. 

Proof: Let and g(t) denote the states of P and M, respectively, at time t. It follows 
from the construction of M that: 

1) 6{t) G q(t) => v(t) > v{t) 

2) 6(t) G q(t) 0(t + 1) G q(t + 1) 

We have 8(0) G [0, 2n) = [ai,ai + 27r) = [ai,a 2 n+i)- When M is initialized to g(0) = q Q , 
9(0) G q(0), hence v(0) > v(0) and ^(l) G q(l). The statement of the Lemma thus follows 
by induction on t. ■ 
Since the initial state of M will be fixed to q a , the actual states of M consist of those 
states in Q that are reachable from q a . The problem of computing the reachable subset Q can 
be recast as either one of two well-studied problems: (i) A one-to-all network shortest path 
problem, which can be efficiently solved (polynomial time in n) using any of the available 
shortest path algorithms (Dijkstra's, Bellman-Ford, . . .) [1], or (ii) the problem of computing 
the accessible states of an automaton, which can be efficiently solved by constructing the 
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transition tree of the automaton [16]. Thus any of a standard collection of algorithms can be 
used to compute Q; interested readers are referred to the above two references for details of 
the various algorithms. 

The dynamics of M are thus given by: 

q(t + l) = f(q(t),u(t),y(t)) 

v(t) = h(q(t),u(t)) 

where the state transition function and output functions are given by / 
9 = 9\q> h = Mqxu, respectively. 

C. Description of the Approximation Error 

In this section, a procedure for computing an upper bound on the gain of the error system A 
associated with a given plant P and a corresponding nominal finite state model M, constructed 
as described in the previous section, is presented. As before, Q refers to the actual states of 
M. 

Lemma 2: If there exists a function V : Q — > R and a 7 G (0, 1) such that 

V(f{q,u,y)) - V(q) < 1P {u) - d{q) (15) 
holds for all q G Q, u G U and y G y, where d : Q — > {0, 1} defined by: 

d(q) = l° if q = Vl (g) ° Vq = V -^ 
^ ' 1 1 otherwise 

then the error system A with M is initialized to q a satisfies © for that choice of 7. 

Proof: By summing up (IT5T ) along any state trajectory of M from t = to t = T, we 

get: 

T 

£>(«(*)) - <*(?(*)) > V(q(T))-V(q(0)) 

> mm(v( qi )-V(q 2 )) 

Hence, we have: 

T 

inf V 7 p( M (t)) - d{q{t)) > min (v(q x ) - V(q 2 )) > -00 

T>0 Qi,Q2 V / 

~ t=0 

It follows from Lemma Q] that when M is initialized to g(0) = q a , we have 9{t) G q(t) for 
all t, where 9 and q are the states of P and M, respectively. Thus, when q(t) = Vi(q(t)) or 
q{t) = V-i(q(t)), y(t) = y(t) and w(t) = 0. Otherwise w(t) G {0, 1}. Hence w(t) < d(q(t)) 
for all t and all feasible input/output signal pairs of A satisfy ([8]). ■ 

An upper bound for the gain of A can thus be computed by solving a linear program in 
which we minimize 7 such that (IT3T) holds for all q G Q, u G U and y G y. This linear 
program has N + 1 decision variables and 4iV inequality constraints, where N = card(Q). 

Remark 5: Recall that the approximation error A is a complex system with both continuous 
and discrete states. Thus the appeal of this approach to computing a gain bound for A is its 
simplicity and its computational efficiency. The downside of this approach is that it results in 
conservative gain bounds, for two reasons: 



(14) 

= flaxuxy and 



11 



1) It inherently assumes that an error occurs every single time it can. 

2) It assumes that all signal pairs (u, y) G W z + x y z + are valid input sequences for M, 
which is not the case since y is an output of P corresponding to u. 

Thus, in using this approach, we are in effect trading off simplicity and efficiency versus 
conservatism. 

IV. Controller Design 
A. Two Related Synthesis Problems 






Fig. 5. Robust full state feedback control design setup 



Consider the following two controller synthesis problems: 

Problem 1: Given a plant P as in ©, © and ©, design a controller K C x W z+ 
such that the feedback interconnection (P, K) shown in Figure Q] satisfies the performance 
objective 

T 

sup V (v(t) + r) < oo (4) 

1 - t=0 

for some R > 0, for any initial condition of P. 

Problem 2: Given a plant P as in (OQ), © and © and a finite state approximation 
M of P constructed as described in Section IIII-BI with the corresponding error system 
satisfying gain condition ® for some 7 G (0, 1). Design a full state feedback control 
law i-p : Q —7- U such that the interconnection (M, A, ip) shown in Figure |5] satisfies the 
auxiliary robust performance objective 

T 

sup V (v(t) + r) < 00 (9) 
1 - t=o 

for some R > 0, for all systems A satisfying gain condition ©. 
Problem 1 is the original problem of interest. Satisfaction of performance objective © 
guarantees that the state of the closed loop system globally exponentially converges to the 
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origin, on average, at a rate not less than R. To see that, note that this performance objective 
can be equivalently re-written as 

\\x(t)\\ < k(x(0))10- Rt \\x(0)\\, Vt > 0,x(0) G R 2 

where k{x{0)) = 1Q S ( X (°))- R with 



S(x(0) 



sup y2(v(t) 



R) 



when the plant is initialized to x(Q). The largest value of R for which © holds is then 
the guaranteed rate of exponential convergence; the actual rate of convergence may be 
(significantly) better. 

Problem 2 is of interest as its solution provides a solution for Problem 1 . In particular, let 
ip be a full state feedback controller such that (M, A, ip) satisfies © for some R > 0, for 
all admissible A. Recall that by construction, the outputs v of (M, A) and v of P satisfy 
CO) whenever the same input u G W z + drives the plant P and the interconnection (M, A). 
Thus, to ensure that © holds for interconnection (P, K) whenever © holds, it is sufficient to 
ensure that the controller K connected in feedback with plant P is identical to the subsystem 
with input y and output u in the interconnection (M,A,ip). The structure of the resulting 
controller K is shown in Figure [6] K thus consists of M, a deterministic finite state machine 
"observer" for the plant and (p, a corresponding full state feedback control law. 

r 1 



. U 1 




« q 


M 


-< — 




1 






— 



















K\ 



Fig. 6. The finite state stabilizing controller K 



B. Design of a Full State Feedback Robust Switching Law 

The following problem is addressed in this section: Given a deterministic finite state 
machine M C x U z+ ) x (U z+ x V z+ x y z +) with state set Q, a scalar 7„ G (0, 1), and 

an uncertainty class 

A 7o = | A C U z+ x W z+ A satisfies © for 7 = 7o |. 

Design a robust switching law ip : Q -^U such that the closed loop system (M, A, ip) shown 
in Figure \5\ satisfies the auxiliary robust performance objective 

T 

sup V (v(t) + R) < 00 (9) 

1 - t=0 

for some R > 0, for all admissible uncertainty A G A 7o . 

5 The notion of exponential convergence considered here is slightly different than that of global exponential convergence 
defined in standard textbooks such as [14], [33]; in particular, we are not requiring the term k(x(0)) to be uniformly bounded. 
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The robust switching law <p will be designed using dynamic programming techniques and 
a "small gain" argument. Consider the feedback interconnection of two systems S and A as 
in Figure [7J The 'Small Gain Theorem' (Theorem [D and Corollary [T] presented here without 
proof, are directly adapted from Theorem 1 and Remark 4, respectively, in [31]. 

Theorem 1: (A 'Small Gain' Theorem) Suppose that S satisfies 



~ Vs{v{t),u{t)) > -oo 



(16) 



t=o 



for some p s : TZ x W — > R and : V x W -)• I, where W, V and W are finite sets, and 
that A satisfies 



infJ^PAKO) -fiA{w(t)) > -oo 



(17) 



t=o 



for some p A : U — > R and p A : VV — >■ R. Then the interconnected system (S 1 , A) with input 
r and output £> satisfies 



inf ^p{r{t)) -fi(y(t)) > -oo 



t=o 



for p : K R and p : V -»■ R defined by 



p(r) = max{p 5 (r, to) - p A (w)} 

ioEW 

/i({)) = mm{p s (v, u) - Pa(u)} 
ueu 



(18) 



w(t) e W 



r(t) G 7e 



A 



5 



u(t) G W 



u(t) G V 



Fig. 7. Setup for the 'Small Gain' Theorem. 



Corollary 1: The interconnected system satisfies (fT8l) for p : 7?. — > R and // : V — »• R 
defined by 

p(r) = max{p 5 (r,w) - rp A (w)} 

p(v) = mm{p s (v,u) - rp A (u)} 

for any scalar parameter r > 0. 

It follows from Theorem \T\ and Corollary Q] that the objective in © can be achieved by 
designing a switching law <p : Q — > U such that the feedback interconnection S = (M, <p) 
satisfies (fT6b with 

p s (r,w) = th(w)-R, (19) 

Ps(0, u) = -0 + TJ p(u) 
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for some R > and r > 0, for p and p defined as in ([8]). Note that the exogenous input r 
can be assumed to be constant here (representing the desired rate of convergence). 

With the auxiliary robust performance objective reformulated as a design objective for 
system S = (M,cp), the design of the full state feedback switching law can be carried out 
by solving a min-max optimization problem using standard dynamic programming techniques 
[3], [5]. The following theorem precisely formulates this approach: In this setting, function 
J and operator T play the role of the "cost-to-go" function and the "dynamic programming" 
operator, respectively. Value iteration is used to solve for the cost-to-go function, and the 
sought full state feedback switching law is then simply the optimizing argument. For the sake 
of completeness, a proof is outlined in the Appendix. 

Theorem 2: Consider a deterministic finite state machine M with state transition equation 

q(t+l) = f(q(t),u(t),w(t)) 

where Q is the state set and hi and W are the input sets. Let a : QxWxW->Kbea given 
function. The following three statements are equivalent: 

(a) There exists a <p : Q — > hi such that the closed loop system (M, </?) satisfies 

T 

M^2a(q(t),<p(q(t)), w(t)) > -oo. (20) 
- t=o 

(b) There exists a function J : Q — > M + such that the inequality 

J(q) > T(J(g)) (21) 
holds for any q e Q, for T : R s R Q defined by 

^(Jio)) = mrn max{— a(q, u, w) + J(f(q, u, w))\. (22) 

(c) The sequence of functions Jk : Q — > M, k E Z + , defined recursively by 

Jo = 

J fe+1 = max{0,T(J fe )} (23) 

converges. 

For the particular control problem of interest, the cost function 

a(q, u, w) = rp(w) — R — v(q, u) — Tj p(u) 

involves two parameters, the convergence rate R and the small gain scaling parameter r. 
Ultimately, the goal is to maximize R > for which there exists a J : Q — y R and a r > 
such that (f2T|) holds. However, since it is not possible to directly compute the optimal value 
of R, a numerical search is carried out yielding a suboptimal value of R: First, a range of 
values of r for which the design objective can be met when R = is computed. Different 
values of r are then sampled in this range to compute the largest feasible corresponding value 
of R, with the largest of those being the (suboptimal) guaranteed rate of convergence. 
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V. Illustrative Examples 

An academic example is presented in this section to illustrate the proposed design procedure. 
The example was chosen for its simplicity and amenability to analysis in the ideal (full state 
feedback) case, thus allowing us to numerically compare the performance of our finite state 
controllers in the binary sensing setting to the best achievable performance in the ideal setting 
for a given pair of plants. 

Consider a harmonic oscillator 



Xi 

x 2 



x 2 
kxi 



described in polar coordinates by 



r = 



9 



(l + k)rsm(9) cos(fl) 
(1 + k) cos 2 (#) - 1 



When k — — 1, r — and the state trajectories are concentric circles centered at the origin. 
For any k ^ — 1, r < in exactly two sectors of the state space given by 



6 e 



(0,7r/2)U(7r,37r/2) 
(7r/2,7r) U (37r/2,2vr) 



when k < — 1 
when k > — 1 



Thus in the ideal case where the switching controller has full access to the state and switching 
can occur at any time, it is always possible to stabilize a pair of harmonic oscillators with 
k — — 1 and G f \ {-1} by appropriately switching between them. 

Now suppose that the system is sampled and the only sensor information available for use 
by the controller is the sign of the position measurement. The stabilization problem becomes 
much more difficult because of the non-trivial state estimation problem that arises due to 
binary sensing and the time delays introduced by sampling. The approach described in this 
paper will be used to design a finite state stabilizing controller. 

The dynamics of the system, sampled at times tT s , t G Z+, for a given sampling period 
T s , are 



x(t + l) 

y(t) 

v(t) 



A s (u(t))x(t) 
sgn(xi(t)) 

\<t + i)\w 



log{- 



where A s (0), A s (l) are given by A s (u) = e A( - u)Ts with 

A(0)- 



1 
-1 



A(l) 



k D 



In this setting, applying inputs u = and u — 1 effectively corresponds to switching between 
"passive" and "aggressive" control modes, respectively. The sensor measurement y is assumed 
to be available at the beginning of each sampling interval [tTs, (t + l)T s ) at which point a 
plant (or equivalently, a value of k) is chosen by the controller K and held until the end of 
the sampling interval. The sampling interval T s is assumed to be a design parameter in this 
case. In order to counteract some of the conservatism introduced in the computation of an 
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TABLE I 

Explanation of Reported Parameters. 
n 2n is the total number of intervals partitioning the unit circle 

N Actual number of states of the deterministic finite state machine approximation (N < 2n(2n — 1) + 1) 

Ts Sampling interval 

7 Tightest verifiable gain bound for the approximation error 

R Guaranteed rate of exponential convergence of the sampled system 

p Number of iterations required for convergence of the Value Iteration algorithm 



TABLE II 
Data for ExampleQ] 



n 


N 


T S 


7 


R 


V 


5 


39 


0.6283 


0.75 


0.0160 


10 


10 


107 


0.3142 


0.625 


0.0267 


20 


15 


207 


0.2094 


0.5833 


0.0195 


28 


20 


331 


0.1571 


0.5625 


0.0152 


39 



upper bound for the gain of A, the unit circle is uniformly partitioned into In intervals and 
the sampling rate is matched to this partition, that is T s = -. This ensures that a partition 
interval maps to another partition interval in the "passive" control mode. 



Example 1: In particular, consider the case where k Q = — 3. The coarsest partition of the 
unit circle that gives rise to a good enough approximate model allowing for successful control 
design corresponds to n = 5. The relevant data for several choices of n are shown in Table 
II, and the implementation of the resulting controllers is plotted in Figure [H together with 
the corresponding plot in the "ideal" (full state feedback, unsampled) setting for comparison. 
Table I explains the significance of the various parameters reported for the examples. In this 
case, refining the partition by increasing n improves both the provable gain bound for A and 
the rate of convergence up to a certain point. The improvement seems to taper off beyond 
n = 15, likely due to numerical errors. Note that while R decreases for the cases where 
n — 15 and n = 20, the actual rate of convergence still improves as seen in the simulations, 
thus reinforcing the fact that the actual rate of convergence may be significantly better than 
the provable rate. 



Example 2: Now consider the case where k Q = 2. In this case, the coarsest workable 
partition of the unit circle giving rise to a good enough approximate model allowing for 
successful control design corresponds to n = 6. The relevant data for several choices of n are 
shown in Table III, and the implementation of the resulting controllers is plotted in Figure [9} 
together with the corresponding plot in the "ideal" setting for comparison. Note in this case 
that even though the best provable gain bound for the error system A remains unchanged 
between n = 6 and n = 7, the difference between v and v decreases as the unit circle partition 
is refined, hence the quality of approximation effectively improves as does the guaranteed rate 
of convergence. Also note that in this case the improvement stalls much sooner, around n = 8. 
This is likely due to the fact that the "aggressive" control law is much more aggressive than 
in the previous example, and hence numerical errors affect the computation sooner. 
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Fig. 8. Implementation of the DFM controller in Example 1 



TABLE III 
Data for Example[2] 



n 


N 


T S 


7 


R 


P 


6 


39 


0.5236 


1 


0.0141 


11 


7 


55 


0.4488 


1 


0.0387 


13 


8 


69 


0.3927 


1 


0.0279 


17 




Fig. 9. Implementation of the DFM controller in Example [2] 
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VI. CONCLUSIONS AND FUTURE WORK 

A new constructive approach for designing switching controllers to stabilize pairs of discrete- 
time homogenous second order systems under limited information (due to binary sensors) was 
presented. The hybrid plant is first approximated by a deterministic finite state machine, and 
a useable description of the resulting approximation error, in the form of a "gain" bound, is 
established. Using a "small gain" argument and dynamic programming techniques, a controller 
is then synthesized to robustly stabilize the nominal (approximate) model in the presence of 
admissible (approximation) uncertainty. The resulting stabilizing controller then consists of 
a deterministic finite state machine observer for the hybrid plant and a corresponding full 
state feedback switching law. While the proposed approach is inspired from classical robust 
control, the use of deterministic finite state machines as an alternative class of nominal models 
necessitates the development of a new set of complementary approximation, analysis and 
control tools. 

Future work will focus on three main directions: 

1) Reducing the conservatism of the approach: As mentioned in the paper, the proposed 
approach for computing gain bounds for the approximation uncertainty A is efficient 
(polynomial in the size of the nominal models) but conservative. One direction of future 
effort will thus focus on developing alternative tractable approaches for verifying tighter 
gain bounds, allowing for the design of stabilizing controllers with better performance. 

2) Extending the approach to broader problems: Another direction of future work will 
look into extending the proposed analysis, approximation and synthesis tools to broader 
classes of plants beyond homogeneous systems, in addition to identifying alternative 
performance objectives (beyond stability) that can be captured by "gain conditions". 

3) Addressing the issue of scalability: While the design approach presented here can, in 
principle, be applied to systems of arbitrary order, it scales poorly (exponentially) with 
plant order, as do most existing hybrid design techniques. Thus one important direction 
of future work is finding ways of improving the scalability of the approach, potentially 
by identifying and exploiting system structure. 
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Appendix I 
Proof of Theorem [2] 

Several statements will be useful in proving Theorem [21 The first one, Theorem [31 is adapted 
from Theorem 3 in [31] and is presented here without proof. 

Theorem 3: Consider a deterministic finite state machine M with state transition equation 

q(t + l) = f(q(t),u(t),w(t)) 

where Q is the state set and U and W are the input sets. Let a : QxU xW — > R be a given 
function. The following two statements are equivalent: 
(a) The inequality 

T 

inf ^2a(q(t),u(t),w(t)) > -oo 
~ t=o 
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is satisfied for any (u, w) G W z + x W z + and g(0) G Q. 
(b) There exists a non-negative function J : Q — >■ R + such that the inequality 

J(f(q,u,w)) - J(q) < a(q,u,w) 

holds for all q G Q, u EU and u> G W. 
In the terminology of Willem's theory of dissipative systems [35], J in Theorem [3] is the 
storage function of the dissipative system with supply rate a. 

Lemma 3: Function T defined in (l22l) is monotonic, that is 



Ji < h => T(Jx) < T(J 2 ). 
Proof: When Ji < J 2 , we have 

Ji(q) < Mq), Vg G Q 

a(q,u,w) + Ji(/(g,w, w)) < a(q,u,w) + J 2 (f(q,u,w)),Vq G Q,u eU,w eW 
=>■ max{cr(g, w, if) + JAf(q,u,w))} < max{a(q,u,w) + J 2 (f(q,u,w))},Wq G Q,u EU 

=>- minmax{cr(g, w, w) + Ji(f(q,u,w))} < minmax{cr(g, w, u>) + J 2 (f(q, u, w))}, Vg G <2 
«gw mew «gw mew 

=>• T(Ji)(g) < T(J 2 )(g),Vg G Q 

Thus T(Ji) < T(J 2 ). ■ 
Lemma 4: The sequence {J fc } defined in (T23T) is monotonically increasing. 
Proof: The proof is by induction on k. We have 

Jx = max{0, T( J )} > = J . 

Now suppose Jfc > Jfe-i. Then 

J fc+ i = max{0,T(J fe )} 

> max{0,T(J fc _ 1 )} 

> Jk 

which completes the proof. ■ 
Given a scalar c G R and a function J : Q — > R, denote by J + c the function from Q to 

R defined by (J + c)(g) = J(g) + c. 

Lemma 5: For any J : Q R and c G R, T( J + c) = T(J) + c. 
Proof: For any g G Q we have 

T((J + c)(g)) = minmax{cr(g, u, u;) + (J + c)(/(g, u, w))} 

= min maxicxfg, u, w) + J (f(q, u, w)) 4~ c} 

= min max{cx(g, u, w) + J(f(q, u, w))} + c 

= T(J(g))+c 

■ 

We are now ready to prove Theorem [2] 
Proof: (Theorem |2|) 
(b) =>• (a): Suppose there exists a function J : <2 — > R satisfying (|2T|) . and let 

V9(g) = argmin < max ( — <r(g, u, w) + J(f(q, u, w)) ) >. 
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For any q G Q, we have 

J(q) > T(J(g)) 

= minmax{— a(q, u, w) + J(f(q, u, w))} 

= max{-a(q,<p(q),w) + J(f(q, <p(q), w))} 

> -a(q,ip(q),w) + J(f(q,(f(q),w)),\/w G W. 

It follows from Theorem [3] that the (deterministic finite state machine) closed loop system 
(M, <p) satisfies (1201) . 

(a) => (c): Suppose there exists a tp such that the closed loop system satisfies (|20l ). By 
Theorem [3l there exists a J : Q — >• M + such that 

w)) - J{q) < cr(q, <p{q),w),Vq EQ,wEW 

We then have 

J(q) > -a(q,ip(q),w) + J(f(q,ip(q),w)),VqeQ,weW 
> max{-(r(g,^(g),U7) + J(f(q,<p(q),w))},Vq G Q 

= minmaxi— a(q, u, w) + J(f(q, u, w))}, Wq G Q 

= T(J(g)),VgGQ 

It follows that function J + c also satisfies 

(J + c) > T((J + c)) 

for any choice of c G 1. Since the set Q is finite, we can assume, without loss of generality, 

that J > with min J(q) = and max J(q) = m > 0. 

i i 

Moreover, the sequence {J k } defined in (1231) is bounded above by J. The proof is by 
induction on k. We have 

J = < J 

Suppose that Jk < J. Then 

T(J fc ) < T(J) < J 
where the first inequality follows from Lemma [3] and 

J k+1 = max{0,T(J fc )} < J. 

Sequence {J k } is thus monotonically increasing (Lemma HI) and bounded above by J. Hence 
it converges to J* = lim J k < J. 

k— too 

(c) =>• (b): Suppose that the sequence {J k } converges to J* = lim J k and let 

fc— ^oo 

e k = max{J*(g) - J k {q)}. 

Note that e k > 0, the sequence {e k } is monotonically decreasing (follows from Lemma S]) 
and lim e k = 0. Moreover 

fc— >oo 

Jk{q) > J*(q)-e k ,yqeQ. 
It follows from Lemmas [3] and \5\ that 

T(Jft) > T(J* - e fc ) = T(J*) - e k . 
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But we have 

J fc+ i = max{0,T(J fe )} 

> T(J k ) 

> T(J*)-e k . 

Thus 

J* = lim J k+1 > T(J*) - lim e k = T(J*). 

fc— >oo k^foo 
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